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ABSTRACT 

Large catalogs of shear-selected peaks have recently become a reality. In order to properly interpret the 
abundance and properties of these peaks, it is necessary to take into account the effects of the clustering of 
source galaxies, among themselves and with the lens. In addition, the preferred selection of lensed galaxies 
in a flux- and size-limited sample leads to fluctuations in the apparent source density which correlate with 
the lensing field {lensing bias). In this paper, we investigate these issues for two different choices of shear 
estimators which are commonly in use today: globally-normalized and locally-normalized estimators. While 
in principle equivalent, in practice these estimators respond differently to systematic effects such as lensing bias 
and cluster member dilution. Furthermore, we find that which estimator is statistically superior depends on the 
specific shape of the filter employed for peak finding; suboptimal choices of the estimator+filter combination 
can result in a suppression of the number of high peaks by orders of magnitude. Lensing bias generally acts to 
increase the signal-to-noise v of shear peaks; for high peaks the boost can be as large as At/ wl-2. Due to 
the steepness of the peak abundance function, these boosts can result in a significant increase in the abundance 
of shear peaks. A companion paper (Rozo et al. 2010a) investigates these same issues within the context of 
stacked weak lensing mass estimates. 

Subject headings: cosmology: clusters, weak lensing, large scale structure 



1. INTRODUCTION 

The abundance of rare massive objects (clusters) in the 
Universe has emerged as a powerful probe of cosmology 
(Vikhlinin et al. 2009; Rozo et al. 2010b; Mantz et al. 2009; 
Henry et al. 2009; Vanderlinde et al. 2010). Many different 
techniques can be used to find these clusters, such as optical 
identification, X-rays, Sunyaev-Zeldovich effect, and gravita- 
tional lensing. Among these, lensing stands out as being the 
least sensitive to the complicated baryonic physics that gov- 
ern galaxy formation and the non-thermal processes that af- 
fect the dynamics of the intra-cluster medium. Consequently, 
the lensing signal is expected to be the easiest to predict, a 
fact that has fostered great interest in developing weak lensing 
cluster finders (Schneider 1996; Hennawi and Spergel 2005; 
Hamana et al. 2004), and has even lead to the publication of 
several lensing selected cluster samples (Miyazaki et al. 2007; 
Wittman et al. 2006; Gavazzi and Soucail 2007). In practice, 
lensing selection suffers from significant projection effects: 
the lensing signal of a cluster can be enhanced by a favorable 
projection of a triaxial halo; by associated mass distributions 
(substructure filaments); or by the chance superposition of 
large-scale structure along the line of sight. Fortunately, such 
superpositions can be calibrated by relying on numerical sim- 
ulations, and weak lensing peak statistics remain a promising 
probe for cosmological physics (Wang et al. 2009; Dietrich 
and Hartlap 2009; Kratochvil et al. 2009; Marian et al. 2010). 

Weak lensing shear measurements use the shapes of dis- 
tant galaxies in order to statistically extract the lensing signal. 
To date, most work has assumed that source galaxies are ran- 
domly distributed, whereas in practice we know galaxies are 
clustered. The apparent clustering receives two contributions: 
one from intrinsic (physical) clustering of the galaxies, hence- 
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forth referred to as source clustering; the other from fluc- 
tuations in the galaxy density induced by lensing itself, via 
lensing bias (also known as magnification bias; Broadhurst 
et al. 1995; Schmidt et al. 2009a). The intrinsic clustering of 
source galaxies acts to increase the noise in the shear mea- 
surement, while the lensing-induced fluctuations can bias the 
shear measurement if they are not properly accounted for. In 
fact, lensing bias can be seen as a probe of weak gravitational 
lensing in its own right (Schneider et al. 2000; Van Waerbeke 
et al. 2010, Rozo and Schmidt, in preparation). 

In a companion paper (Rozo et al. 2010a, henceforth re- 
ferred to as paper I), we study similar issues for stacked weak 
lensing analyses of groups and clusters. The two papers are 
highly complementary, as paper I focuses on high signal-to- 
noise weak lensing mass calibration of objects that have been 
previously identified and then stacked, whereas in this work 
we focus on identifying shear peaks in the low signal-to-noise 
regime. 

Following paper I, we discuss two possible filtered shear 
estimators, which differ in the way the estimator is normal- 
ized. The first estimator uses a fixed normalization (e.g., Ma- 
turi et al. (2005); Wang et al. (2009)), and is therefore sensi- 
tive to the overall modulation of the source density field by 
the lensing signal. The second estimator uses an individual 
normalization for each point in the sky (e.g., Miyazaki et al. 
(2007)). In this approach, the fluctuations in the density of 
background galaxies are partially canceled out, which reduces 
noise, but also shrinks the extra signal due to magnification. 
Moreover, a location-based normalization estimator can lead 
to dilution of the lensing signal if the source population is con- 
taminated by cluster galaxies (see paper I for a more detailed 
discussion). 

Throughout, we adopt a fiducial flat ACDM cosmology 
with h = 0.7, Cl m = 0.28, n s = 0.96, and a power spectrum 
normalization of us = 0.85 at z = 0. All masses are defined 
as M2oom> i- e - enclosing an average density of 200 times the 
mean matter density. The source galaxies are assumed to fol- 
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low the redshift distribution expected for the Dark Energy 
Survey (DES) 4 , 

— ocz 2 exp[-(z/zc/] (1) 

with zo = 0.5 and /3 = 1.4, and we assumed a density of h = 
20 arcmin" 2 . This source density is slightly larger than that 
expected for DES, but smaller than that of other future surveys 
such as Hyper Suprime-Cam or the Large Synoptic Survey 
Telescope (LSST). 

Section 2 presents the shear estimators, and discusses how 
lensing bias impacts each of these estimators in turn. We also 
discuss the choice of filter function and optimal filter scale. 
Section 3 presents the results on the statistics of lensing peaks. 
We conclude in Section 4. Details on the lensing calculations, 
and two derivations regarding the variance of smoothed shear 
filters have been relegated to the appendix. 

2. SMOOTHED SHEAR ESTIMATORS 
In this section, we focus on estimators of the form 



A{0)=\Y^emdi-0). 



(2) 



where the sum is over all galaxies 5 , n is the mean source 
galaxy density, e, is the tangential component of the ellipticity 

of galaxy i (with respect to the relative position 9,-9), and W 
is an arbitrary filter which we assume is unity-normalized, 



d 2 9W(9) = 1. 



(3) 



For the time being, we leave the filter unspecified. The es- 
timator Eq. (2) has a fixed normalization. We consider the 
alternative choice of a varying normalization in Section 2.3. 

In order to derive the statistical properties of A, we proceed 
as follows: first, we divide the sky into infinitesimal pixels 
of area Ail such that the number of galaxies in each pixel is 

either or 1 . Letting n{9) be the galaxy density field on the 
sky, we can rewrite Eq. (2) as 



A(9): 



n — ^ 



(4) 



where the sum is now over all pixels, and the index i denotes 
that the quantity of interest is evaluated at 9j. For example, 
Hi = n(9j). We now set 

«,=«(l + ( 5,) (5) 

where Si is the galaxy density fluctuation (in this section, we 
assume the fluctuations are purely Poisson). Assuming (e,) = 
gi where g = 7/(1 — k) is the reduced (tangential) shear, we 
find that the expectation value of A is given by 

(A(9)) = J^SiWAQ = I d 2 9'g(9')W{\9-9'\). (6) 

For the second equality, we have let Af2 — > (continuum 
limit), and correspondingly replaced ^ ,Af2 with J d 2 9. Fur- 
ther, we have assumed that the source galaxy overdensity S is 

4 http://www.darkenergysurvey.org/ 

5 Note that for simplicity, and in order to keep results general, we have not 
included any galaxy weights, which in practice will be used in particular if 
photometric redshifts are available. 



uncorrected with the shear. In the following discussion, we 
will set 9 = without loss of generality. We can compute the 
variance of A in a similar fashion. In particular, using Eq. (4) 
and Eq. (5) we find 



A 2 = + SiSj^iejWiWjiAQ) 2 



(7) 



where have ignored terms proportional to S since these will 
go to zero upon taking the expectation value. Neglecting any 
clustering of the source galaxies for the moment, we have 



(8) 



while the expectation value of the galaxy ellipticities takes the 
form (recall that e, stands for the tangential component of the 
ellipticity) 

2 

gig j+Y Si *' (9) 

Here, a e denotes the RMS (intrinsic) galaxy ellipticity of the 
sample. Using these expressions, taking the expectation value 
of Eq. (7), and subtracting off (A) 2 , we find 

Var(A) =\Jd 2 9 W 2 (9) (£■ +g 2 (0)^ . (10) 

In many cases, \g\ <C cr e ~ 0.3 and the term g 2 (9) in Eq. (10) 
is often neglected. In that case, 



Var(A) = 



2/7 



d 2 9W 2 (9) = 



1 



In 47t9 2 ' 



(11) 



where for the second equality we have assumed a Gaussian 
shear filter of width [Eq. (12)], which is the standard result 
for the variance of A (van Waerbeke 2000). 

2.1. Filter Functions 

A variety of smoothing kernels have been proposed for av- 
eraging the shear, including top-hat, aperture mass (Schneider 
et al. (1997)), and matched filters (Maturi et al. (2005); Mar- 
ian and Bernstein (2006)). Some differences in the filters are 
due to the various goals they were designed to achieve; for 
example, to reduce contributions from small scales, or large 
scales, or the mass-sheet degeneracy present in shear mea- 
surements. 

In this paper, we will discuss three commonly used or pro- 
posed filters, which are shown in Fig. 1. It is instructive to 
compare this figure with Fig. 8 in the appendix, which shows 
the angular scales contributing to the lensing signal. Our first 
and perhaps simplest choice is a Gaussian shear filter, 



W(6) 



1 



27t6 



jexp 



9 2 
~26 2 



(12) 



where denotes the filter scale, which can be chosen to max- 
imize signal-to-noise for a given lens mass and redshift. This 
filter is also similar in shape to the filter presented in Maturi 
et al. (2005) designed to reduce the contribution from unasso- 
ciated large-scale structure. 

Another choice is a filter constructed using the method 
of Kaiser and Squires (1993) to yield an estimator of the 
Gaussian-smoothed convergence k ("KS-Gaussian"). This is 
equivalent to searching for peaks in smoothed convergence 
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Figure 1. Shear filter functions considered in this paper, as a function of 
angular separation: Gaussian-smoothed shear filter; Gaussian- smoothed con- 
vergence filter using the method of Kaiser and Squires (1993); and matched 
NFW filter (Marian and Bernstein 2006). 



where g is the reduced shear profile of an NFW halo (see Ap- 
pendix A), and the constant is determined from the normal- 
ization constraint Eq. (3). Note that as written, the filter is 
optimal for uniform noise and in the absence of lensing bias; 
it can easily be generalized to take into account the effects dis- 
cussed in this paper. In Marian and Bernstein (2006), the filter 
was truncated at the scale corresponding to the virial radius 
(or R200) of the halo (i.e. W = for 9 > # max ); in general, one 
can vary the truncation scale of the filter. Fig. 1 shows that the 
matched-NFW filter (with truncation at /?2oo) is in fact quite 
similar to the Gaussian shear filter. 

We will return to the choice of filter and optimal filter scale 
6 in Section 2.5. 

2.2. Lensing bias and Source Clustering 

In the presence of foreground lensing matter, the num- 
ber density of a flux- and size-limited sample of background 
galaxies is affected by two effects: galaxies get pushed over 
the flux and/or size threshold by lensing magnification, and 
their density is diluted because the observed patch of sky is 
stretched (Broadhurst et al. (1995)). These two effects com- 
bine in such a way that the observed source galaxy density 
field is related to the unlensed background density field via 6 

n ob M = n[l+5 g (8)M8)' l/2 (15) 
where the magnification fx is given by 7 



maps (e.g., Hamana et al. 2004; Wang et al. 2009, but see 
caveats below). The shear filter function is given by: 



W(8) = 



1 



1- 



1 + 29^' eXP 



e 2 

~29 2 



(13) 



where 9 is the width of the Gaussian convergence filter. This 
filter is usually used by pixelizing the sky in patches of a 
few arcmin 2 area, and measuring the average shear in each 
pixel. The resulting shear map is then convolved with the 
KS-Gaussian filter. While we do not explicitly consider pix- 
elized estimators here, under certain conditions the globally- 
normalized estimator Eq. (2) is equivalent to a pixelized es- 
timator: this is the case if either the shear in each pixel is 
estimated with a global normalization (such as in Eq. (2)), 
and pixels are weighted equally; or if the shear in a pixel is 
measured with a local normalization, and each pixel is then 
inverse-variance weighted, i.e. weighted by the number of 
galaxies in the pixel, in the further analysis. 

Note that this filter is not normalizable through Eq. (3). In- 
stead, the normalization is defined through the convergence 
filter. Due to the non-local relationship between shear and 
convergence, this filter is quite different from the Gaussian 
shear filter. As shown in Fig. 1, small scales are down- 
weighted, while in principle arbitrarily large scale scales are 
weighted equally (6 2 W ~ const). This is a consequence of the 
mass-sheet degeneracy present in the shear, which can only be 
broken by including very large scales. We will see that due to 
these differences, a shear estimator using the KS-Gaussian fil- 
ter behaves quite differently than an estimator using the other 
filters discussed here. 

Finally, one can choose a filter matched to the expected sig- 
nal of a Navarro-Frenk- White (NFW, Navarro et al. (1996)) 
lensing halo. Following Marian and Bernstein (2006), we 
choose 

W(8) = Cg(8), (14) 



(l-/c) 2 -|7p 



1+2k + 3k 2 +|7| 2 + . 



(16) 



and q characterizes the contributions from magnification and 
size bias (Schmidt et al. (2009a,b)). Specifically, the parame- 
ter q is given in terms of /?/ and /?,-, the logarithmic slopes of 
the flux and size distributions, as: 



q = 2P f +p r -2 



d\nn ohs 



d\nf 



f=fm 



d\nn obs 



d\nr 



(17) 
(18) 



Here / denotes flux and r stands for apparent size of the galax- 
ies. For definiteness, we assume a value of q = 1 .5, in the mid- 
dle of the range estimated by Schmidt et al. (2009a). Consider 
now the expectation value of A in the presence of lensing bias. 
Inserting Eq. (15) into Eq. (4) and taking the expectation value 
we find 

' d 2 8 ^ /2 (8)g(8)W(8) 



(A) 



(19) 



where we have assumed the lensing field does not correlated 
with the source density field <5 ? . We see that the increase in 
the number of background sources (assuming q > 0) leads to 
a higher signal, as expected. 

We turn now to estimating the variance of A in the presence 
of magnification bias and source clustering. We assume that 
a weak lensing shear peak occurs when A is evaluated at the 
center of a lensing halo. For now, we assume that there is no 

6 In the weak lensing literature, it is customary to linearize the magnifica- 
tion term pfll 1 ss (1 -vqn). The parametrization Eq. (15) is the general expres- 
sion valid into the moderate lensing regime, see Broadhurst et al. (1995). 

7 In the following, we will ignore the fact that the source redshift distri- 
bution dN/dz itself depends on 8, and will always calculate fi assuming the 
average dN/dz. Since the lensing efficiency varies slowly with redshift, we 
expect such fluctuations to have negligible impact. 
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contribution to the lensing shear and magnification from other 
matter along the line of sight. The clustering of the source 
population which we neglected in Section 2 is an additional 
noise contribution to A. Using Eq. (15), we have 



(S t Sj) = 



1 



n fjj 



(20) 



where = — is the source galaxy angular correlation 
function. Here, we neglect higher moments of the angular 
distribution of source galaxies, which can become relevant on 
very small scales. In order to calculate £, we make another ap- 
proximation: we assume that galaxies follow the matter dis- 
tribution with a linear bias of ~ 1, and use the fitting formula 
of Smith et al. (2003) for the non-linear matter power spec- 
trum together with the redshift distribution Eq. (1) (see also 
appendix of paper I). Note that £ is the observed correlation 
function, which in principle is also modified by lensing bias 
induced by large-scale structure (Matsubara 2000; Hui et al. 
2007). Since this is a small (< 10%) correction to a usually 
subdominant noise contribution, we neglect this lensing bias 
here given our rough approximations for £. However, we do 
take into account that magnification modifies the source den- 
sity through lensing bias, and correspondingly affects the shot 
noise. 

The mean and variance of source ellipticities remain un- 
changed by lensing bias, so that as before (e,-) = g, and (e,-e_/) = 

T% + S^- 
Putting everything together and plugging into Eq. (7) we 

find 



Var(A) = V shot + V sr 



V shot =i J d 2 6^ 2 (9)W 2 0)(^-+g 2 0) 



where 



d 2 e' ss't(\e-e'\), 



S(9) = ^ 2 (9)g0)W(9), 



(21) 
(22) 

(23) 
(24) 



and primed and un-primed variables are evaluated at 9' and 9 
respectively. We have split the variance of A into a shot-noise 
contribution V s h t, and a contribution from source clustering 
V src - Comparing Eq. (21) with Eq. (10) we see that lensing 
bias increases the shot noise in A due to the increased source 
density (but not as fast as it increases the value of A itself). In 
addition, the clustering of source galaxies adds to the variance 
of A. 

Fig. 2 shows the two contributions to the variance of A for a 
Gaussian filter [Eq. (12) in Section 2.1] as function of the filter 
scale 9. The thick lines show results including lensing bias 
(q = 1 .5), while the thin lines are without lensing bias. For the 
results with lensing bias, we have assumed that the estimator 
is centered on an NFW lensing halo of mass 3 x 10 14 Mq/H at 
redshift zl = 0.3. We see that source clustering is subdominant 
compared to shot noise for an average source density of n = 
20 arcmin -2 . For higher source densities, source clustering 
will become important. We also see that lensing bias increases 
both sources of noise, with V src being affected more strongly. 

Finally, we consider one more source of variance for shear 
estimators: that from the lensing field induced by large-scale 
structure itself. Generally, massive dark matter halos or den- 
sity peaks reside on average in overdense regions. This asso- 
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Figure 2. Contributions to the variance of smoothed shear estimators as a 
function of filter scale for a Gaussian shear filter: shot noise Oghot = \Ashot 
[Eq. (22)], source clustering noise cr src = \/Vsrc [Eq. (23)], and large-scale 
structure variance <tlss (Appendix B). <r s i TOt and <j sn; are shown with (thick 
lines; q = 1.5) and without (thin lines; q = 0) lensing bias, while olss is only 
shown including the very small lensing bias correction (again for q — 1.5). 

dated large-scale structure (LSS) can add to or subtract from 
the lensing signal of the halo itself. Unfortunately, calculating 
this contribution properly is only possible using N-body sim- 
ulations (especially when taking into account lensing bias). 
What we can calculate however is the estimated variance of 
A induced by uncorrelated large-scale structure, ctlss, as de- 
tailed in Appendix B (see also Hoekstra 2001). The result is 
shown as dotted line in Fig. 2, again for the Gaussian shear fil- 
ter. Clearly, large scale structure noise is subdominant for this 
filter as long as 9 is not very large; still, for 9 > 3 arcmin, it 
cannot be neglected. Furthermore, the magnitude of ctlss de- 
pends on the type of filter chosen: for the KS-Gaussian filter, 
the large-scale structure noise is much more significant, at the 
percent level for 9 < 1 arcmin. 

2.3. Another Shear Estimator 

An alternative to Eq. (2) as choice of smoothed shear esti- 
mator uses a location-based normalization: 



B(9): 



E,-g.-w(g.-g) 

J2iW(9i-9) ' 



(25) 



where both sums run over all galaxies. The normalizing de- 
nominator removes some of the fluctuations in the source 
galaxy density (which can be intrinsic or survey-specific, such 
as varying depth of the observations). We can write Eq. (25) 



B(9): 



A(9) 



N(9) 

iV((9) = i VuiWiAQ 

n — ^ 



(26) 



= d 2 9'fi q/2 (9')[l+6(9')]W(9-9'), (27) 
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Figure 3. Top panel: signal-to-noise of estimators A and B without lensing 
bias for a lens halo of mass M at zl = 0.3, using a Gaussian shear filter and 
a KS-Gaussian filter (see Section 2.1). Bottom panel: Change in signal-to- 
noise of A and B induced by lensing bias, for the same filters and lensing halo. 

where in the last line we have employed the continuum limit. 

It is worth pointing out some differences between A and B 
and their response to systematic effects due to source cluster- 
ing. For instance, fluctuations in the number density of galax- 
ies behind the lens are canceled out in B, while they contribute 
to A. On the other hand, if there is an overdensity of galax- 
ies associated with the lens or in the foreground (present in 
the sample for example due to uncertainties in photometric 
redshifts), these galaxies systematically reduce the value of B 
since they contribute zero shear to the numerator in B, despite 
being included in the denominator. This so-called dilution 
(Bernardeau 1998; Medezinski et al. 2007) does not directly 
affect the estimator A. A detailed discussion of the different 
systematics induced in both estimators by photometric red- 
shift uncertainties can be found in paper I. 

When calculating the statistics of B, we face the problem 
that (A/N) ^ (A) I (N) (see also paper I). However, we can 
employ this approximation if the effective number of galax- 
ies used in the shear estimator, ~ «6 2 where 9 is the filter 
scale, is much larger than unity. In this case, we have for the 
expectation value 

(N) fd 2 6[ii/ 2 (6)W(9) ' 

where we have again assumed that the shear is uncorrelated 
with source density. We see that the effect of lensing bias on 
B is partially canceled by the denominator, and therefore the 
expectation value for B is only weakly dependent on lensing 
bias corrections. In the absence of lensing bias and systemat- 
ics, (B) = (A). 

When calculating the variance of B however, it is necessary 
to take into account the covariance between numerator and 



denominator. The full expression for Var(B) is derived in Ap- 
pendix C, and is given in equation C4. The gist of it is that 
fluctuations in the number of source galaxies, due to both shot 
noise and source clustering, are partially canceled out (but see 
below). This becomes especially important for high masses, 
where the source clustering contribution to the variance be- 
gins to dominate. Finally, the large-scale structure variance is 
the same for both estimators, assuming that it is dominated by 
the weak lensing regime (lowest order in k, 7). 

The top panel of Fig. 3 shows the average expected signal- 
to-noise v, defined as v = (A) /[Varti)] 1 / 2 for both A and B, 
as a function of halo mass. We have again assumed an NFW 
lensing halo, and we show results both for a Gaussian shear 
filter of width = 3' and a KS-Gaussian filter with 6 = 0.7' 
(the choices of G will be justified in Section 2.5). Further, we 
have ignored lensing bias for the moment. For the Gaussian 
filter, we find that B is superior to A in terms of signal-to-noise 
for massive halos. For lower mass halos, where g 2 <C c 2 , both 
estimators are equivalent. Similar conclusions hold for the 
NFW-optimized filter. 8 

The KS-Gaussian filter shows a very different behavior: 
here, B has significantly less signal-to-noise than A at all rel- 
evant masses (in fact, v{B) never exceeds ~ 1.4), while A 
performs very similarly to the corresponding Gaussian esti- 
mator. The reason is that the covariance between the numer- 
ator and denominator of B, which leads to the partial can- 
celation of fluctuations for the other shear filters, is strongly 
suppressed for a KS-Gaussian filter. This is a consequence 
of the inclusion of very large scales in the filter (see Ap- 
pendix C). Without significant covariance between numerator 
and denominator, B is just the ratio of two noisy quantities, 
and not surprisingly has less signal-to-noise than A. However, 
as mentioned in Section 2.1, this filter is commonly used on 
a pixelized shear map, rather than on the galaxies directly. 
Hence, Eq. (25) is actually not used with the KS-Gaussian 
filter in practice. Nevertheless, this result illustrates an im- 
portant point: the choice of estimator (e.g., A vs B) and filter 
function W has to be done jointly, as the two are interrelated. 

2.4. Impact of Lensing Bias 

We now turn to the impact of lensing bias on A and B. 
Lensing bias changes both the value (signal) of smoothed 
shear estimators, as well as the signal-to-noise. The latter ef- 
fect is shown in the bottom panel of Fig. 3. Let us discuss 
the Gaussian filter first. We see that for M < 3 x 10 15 M o //i, 
lensing bias tends to increase the signal-to-noise in both es- 
timators, an effect which increases with mass. The effect is 
in fact greater for B than for A, even though the effect on the 
value of B is smaller. This is because lensing bias acts to 
decrease the variance of B in two ways: first, the increased 
source density reduces shot noise; second, lensing bias in- 
creases the covariance between numerator and denominator, 
improving the cancelation of fluctuations in the galaxy num- 
ber above that without lensing bias. In case of A, we see a re- 
versal of the effect at very high masses: lensing bias in fact re- 
duces the signal-to-noise of very high peaks for A. This is be- 
cause for very high halo masses (which are necessary to pro- 

8 In paper I, we found that both types of estimators are statistically equiv- 
alent. This is because, first, source clustering is negligible for the thin annuli 
considered in the stacked weak lensing context; and second, for the relevant 
radial scales, the stacked analysis is in the regime of g 2 <C a 1 . 
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Figure 4. Estimated mass bias when neglecting lensing bias (top) and rela- 
tive error (bottom) on the mass measurement using filtered shear estimators 
A and B. In the top panel, the thick lines show the mass bias expected if the 
signal itself is used to estimate the halo mass (thick lines), and if the signal- 
to-noise is used for the estimate (thin lines). In all cases, we have assumed a 
lens redshift of zl = 0.3, and a Gaussian shear filter with = 3'. 

duce this high signal-to-noise), source clustering takes over 
as the dominant source at this filter scale. While the source 
clustering noise scales similarly with the lensing quantities as 
the signal itself (oc /i 9//2 g), it is weighted by W 2 rather than 
W; this weighting favors smaller radii where the lensing mag- 
nification is stronger (see Fig. 8 in Appendix A), thus boost- 
ing the noise more than the signal. Note however that for 
such halos the optimal filter scale is significantly larger than 
the assumed 3 arcmin. On the other hand, fluctuations due 
to source clustering are largely canceled out in B, and we do 
not see this reversal for this estimator. Instead, the boost in 
signal-to-noise grows strongly towards larger halo masses. 

Again, the KS-Gaussian filter behaves differently: a signif- 
icant increase in v{A) is seen, while the effect on v{B) is a 
small decrease. The causes for the differences are, for A, that 
source clustering is somewhat smaller for the KS-Gaussian 
filter than for the Gaussian filter, and for B, that lensing bias 
increases the variance of the estimator slightly faster than it 
increases the value of B itself. 

It is also interesting to consider what impact lensing bias 
has on the mass that one would assign to each shear peak. 
Assuming one can predict a relation A = A(M), then from the 
amplitude of the shear peak one can estimate its mass. If, 
however, one fails to account for the boost in signal due to 
lensing bias, one will systematically overestimate the corre- 
sponding halo mass. 9 The systematic mass offset is approxi- 
mately given by 



d\nM AA 



(29) 



9 Note that in order to predict a mass-shear-peak relation one needs to 
assume a lens redshift as well as a profile shape, i.e. halo concentration. 
Here, we assume the true values of zl and c are known. 



where the logarithmic slope (ilnA/if InM ss 0.5-0.7 depend- 
ing on mass, and correspondingly for B. The top panel of 
Fig. 4 (thick lines) shows the relative bias AM /M obtained 
with a Gaussian filter with = 3'. Clearly, for the most mas- 
sive halos biases as large as tens of percent are possible. Note 
that the bias in B is smaller than that of A. This is because 
we are only relying on the amplitude of B to estimate a clus- 
ter's mass, and as we have seen, the amplitude of B is less af- 
fected than that of A. The thin lines in the top panel of Fig. 4 
show the expected bias if one would instead use the signal- 
to-noise of a lensing peak to estimate the halo mass, calcu- 
lated using a similar relation to Eq. (29), but for iy(A),i/(B) 
instead A, B. Here, the situation is reversed at high masses: 
the mass bias is larger for B than for A, reaching order unity 
at M « 10 15 M q //l Moreover, the bias in A changes sign at 
very high masses (for a fixed filter scale). This reflects the 
behavior of the lensing bias effect on v seen in Fig. 3. 

Turning now to the statistical uncertainty in the recovered 
mass, we have seen that lensing bias helps increase the signal- 
to-noise of a given halo. Thus, we expect properly accounting 
for lensing bias will reduce the statistical error in cluster mass 
estimates. The expected error in log-mass can be estimated as 



ClnAf - 



<ilnM 1 
din A v k 



(30) 



This is shown in the bottom panel of Fig. 4. Again, the im- 
provement in mass resolution becomes relevant at the high- 
mass end. We also see how the better statistical performance 
of B over A for this filter reflects in the smaller mass uncer- 
tainty at the high-mass end. Note that this should only be seen 
as rough estimate; in practice, halo triaxiality and projection 
effects will increase the error in the recovered masses signifi- 
cantly. 

2.5. Choice of Filter Scale 

In order to optimize the filter shape, one has to adopt a met- 
ric with which different filtrs can be compared. In the follow- 
ing, we will focus on the goal of maximizing the signal-to- 
noise for a given lensing halo, which is most directly relevant 
to shear peak counts. 

In order to test the relative performance of the different 
filters, we show the signal-to-noise v for a 3 x 10 14 M Q //! 
lensing halo at fixed lens redshifts of zl = 0.3, 0.6 in Fig. 5, as 
a function of the filter scale 0. In case of the matched-NFW 
filter, = f^max is the truncation scale of the filter. We use A 
in all cases. The optimal signal-to-noise is determined by a 
balance between the signal A, which decreases with increas- 
ing filter size, and the noise which also decreases. We see that 
all three filters peak at roughly the same signal-to-noise for 
A, though the filter scale at which this peak is reached is very 
different. For the Gaussian filter, the optimal case is around 
9 w 3', which we have thus adopted as default value of the fil- 
ter scale. For the KS-Gaussian, the optimal value is 9 w 0.7'. 
Note that these values depend on the mass and redshift of the 
lens. 

The relative importance of the three noise contributions, 
shot noise, source clustering, and LSS noise, depends on the 
filter shape. In case of the KS-Gaussian filter, the LSS noise 
is much more significant, a factor of ~ 10 higher at the opti- 
mal filter scale than for the other two filters. Again, this is a 
consequence of the inclusion of very large scales in this filter. 
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Figure 5. Signal-to-noise of the estimator A as function of filter scale for 
different filter shapes (truncation scale in case of the matched NFW filter), 
for a 3 X 10 14 Mq//i halo at two different redshifts: zl = 0.3 (top panel) and 
zl - 0.6 (bottom panel). The thin lines show results without lensing bias, 
while the thick lines include lensing bias (q = 1.5). 



This will also be of relevance to the contribution of correlated 
large-scale structure to smoothed shear estimators. 

Fig. 5 also shows the effect of lensing bias (thick lines vs 
thin lines): for this halo, lensing bias boosts the peak signal- 
to-noise by ~ 5% for the Gaussian and NFW-optimized fil- 
ters. In case of the KS-Gaussian, the effect is slightly smaller, 
~ 3%, due to the preference for large scales in that filter. 
Note that the optimal filter scale is moved to slightly smaller 
values by lensing bias. This is because the signal in A in- 
creases more steeply with decreasing filter size when includ- 
ing lensing bias. 

Before deciding on an optimal filter, however, it is neces- 
sary to take into account the effect of associated and coin- 
cidental large-scale structure along the line of sight to the 
lens. Clearly, the likelihood of a chance superposition with 
unrelated matter concentrations with the filter scale (Hoekstra 
2003; Maturi et al. 2005). This might make a true optimal 
filter narrower than suggested by Fig. 5. 

3. THE PEAK FUNCTION 

We now turn to investigating the impact of lensing bias and 
filter choice on the abundance of detected shear peaks. To do 
so, we assume each shear peak corresponds to an NFW halo 
and that the peak position is the halo center. While in practice 
one expects some fraction of weak lensing peaks to arise due 
to chance superpositions of multiple halos, the results we ob- 
tain concerning how lensing bias impacts weak lensing peak 
finding should be indicative of the whole population. 

We estimate the average number of lensing peaks within a 
solid angle f2 s above a given signal-to-noise threshold 1% as 




4 6 8 10 

Figure 6. Upper panel: Average number peak of peaks Ap ea k(> fth) (P er 
deg 2 ) above the signal-to-noise threshold v$i, with and without lensing bias 
for the two estimators A, B. We set = 3 arcmin. Lower panel: Relative 
lensing bias effect on the peak counts, AA r pca | < (> fth)/A'peak( t 'th)- 

follows: 

iW>%)=n, / ^txHzl) (3D 

x / dlnM — — ( ZL ,M), 

JlnM mm (ZL^k) dlnM 

where c is the speed of light, H(z) is the Hubble expansion 
rate, x(z) is the comoving distance to redshift z, dn/dlnM is 
the halo mass function, and M m ; n is defined via 

"(M^Zl) = (32) 

Hence, lensing bias enters by lowering the effective z- 
dependent mass threshold of the survey, so that the number of 
peaks is increased. We use the fitting function of Tinker et al. 
(2008) to calculate dn/dlnM as function of mass and redshift 
from the linear matter power spectrum (recall that M = M2oom 
throughout). 

Fig. 6 shows the peak count statistics Af pe ak(> ^th) (peaks 
per deg 2 ) with and without lensing bias using a Gaussian fil- 
ter with fixed filter scale G = 3 arcmin. We also show the 
relative lensing bias effect ANpe^/N^k. Evidently, estimator 

B is more efficient at finding peaks for this filter, with or with- 
out lensing bias: the number of high signal-to-noise peaks 
(y > 6) is higher than that found by A by orders of magnitude. 
Further, lensing bias can boost the weak lensing peak counts 
significantly for both A and B, by more than a factor of 2 for 
high peaks. At high thresholds > 8.5, we see a turnover, 
which reflects the trend seen in Av (Fig. 3). 

Note that for very high peaks, i.e. massive halos, a filter 
scale of 3 arcmin is not optimal. Hence, it also interesting 
to consider the number of peaks above a fixed threshold as 
function of the filter width (Fig. 7). The different number 
of peaks here reflects the different mass thresholds for dif- 
ferent filter scales at a fixed signal-to-noise threshold. Again, 
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Figure 7. Upper panel: Average number peak of peaks N^.^(u > 5) (per 

deg 2 ), with and without lensing bias for the two estimators A, B as a function 
of the filter scale ©. Other survey specifications as in Fig. 2. Lower panel: 
Relative lensing bias effect on the peak counts, AJV pca k(^ > 5)/N V zjk(v > 5). 

the location-normalized estimator B yields more peaks than A, 
and lensing bias increases the number of 5a shear peaks found 
with either estimator by ^30%. Note that since lensing bias 
pushes the optimal filter scale to smaller values, the number of 
peaks increases faster with scale when incorporating the im- 
pact of lensing bias. Not surprisingly, we find that choosing 
smaller filters can increase the relative importance of lensing 
bias significantly (see the lower panel of Fig. 7). 

Finally, we note that the result of the comparison A vs B 
reverses for the KS-Gaussian filter: A performs far better for 
this filter, as expected after Section 2.3. 

4. SUMMARY AND DISCUSSION 

We have compared the expected signal-to-noise of the two 
most common types of filtered shear estimators for different 
filter functions, and studied how lensing bias impacts these 
estimators. In our signal-to-noise considerations, we also in- 
clude the effects of the intrinsic clustering of source galaxies, 
and the variance of the estimators due to uncorrelated large- 
scale structure. The former noise contribution becomes im- 
portant for massive lensing halos, while the latter's impor- 
tance only depends on the filter shape and scale considered. 

We find that estimator and filter function need to be cho- 
sen jointly and cannot be regarded as independent. For ex- 
ample, for the Gaussian shear filter, the location-normalized 
estimator (B) is statistically superior to the globally normal- 
ized estimator (A) for high peaks. This is because fluctuations 
in the number of galaxies are canceled out to first order in B. 
For the KS-Gaussian filter on the other hand, the situation is 



reversed: A performs far better than B. According to our (cer- 
tainly not exhaustive) results, a location-normalized estima- 
tor with a Gaussian-type shear filter appears to perform best 
statistically. The question of optimal filter+estimator combi- 
nation certainly deserves more attention, as the abundance of 
high peaks can be suppressed by orders of magnitude for sub- 
optimal choices (Fig. 6). 

Another finding, of equal importance to their statistical 
properties, is that the two types of estimators respond very 
differently to uncertainties in the photometric redshifts (see 
the discussion in paper I). Specifically, the globally normal- 
ized estimator A does not suffer from the so-called dilution 
effect affecting B, and should be much less sensitive to con- 
tamination of the source sample by galaxies associated with 
the lens. 

Turning to lensing bias, we find that it affects both of the 
estimators we considered, and for all filter functions. While 
the signal-to-noise of either estimators can be boosted signifi- 
cantly (up to Ai/ ^1-2), the value of the estimator A is gener- 
ally affected more strongly than that of B. Indeed, if one were 
to estimate halo masses based solely on their smoothed shear 
signal, halo mass over-estimates as large as tens of percent 
are possible. Not surprisingly, the increase in signal-to-noise 
especially of high peaks also results in comparable boosts to 
the abundance of observed peaks. The magnitude of the ef- 
fect depends strongly on the filter scale as well: smaller filter 
scales lead to a much larger boost in signal due to lensing bias, 
pushing the optimal filter scale towards smaller values. 

While we have not considered the impact of halo triaxial- 
ity and correlated structures in our work, it is straightforward 
to understand how these can affect our conclusions. Specifi- 
cally, both of these sources of noise tend to increase the weak 
lensing shear signal, and therefore will increase the relative 
importance of lensing bias. Moreover, if one wishes to min- 
imize line-of-sight projections of multiple halos, we expect 
doing so will require smaller filters, resulting in a further in- 
crease of the importance of lensing bias. 

In light of these results, it is clear that a proper modeling 
of lensing bias as well as source clustering will be a neces- 
sary component of cosmological interpretations of the shear 
peak function. Furthermore, these effects should also be taken 
into account when designing an optimal estimator+filter com- 
bination for shear peak finding. Fortunately, incorporating 
these two effects is fairly straightforward, both for analytic 
calculations and numerical studies with N-body simulations. 
Lensing bias, by increasing the signal-to-noise of the shear 
signal, has the potential to significantly boost the statistical 
power of shear selected samples of objects. Thus, properly 
accounting for lensing bias should allow us to maximize the 
cosmological potential of shear peak statistics. 

We would like to thank Richard Ellis, Richard Massey, 
James Taylor, and Scott Dodelson for helpful discussions. 
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Figure 8. Left panel: Profile of convergence k, tangential shear 7, tangential reduced shear g and scaled magnification *JJi- 1 around a halo of mass 3 X 
10 14 Mq//j located at zl = 0.3. Right panel: Reduced shear and magnification weighted by area and the Gaussian filter W, for the same parameters as in the left 
panel. This shows which scales contribute to the signal in Eq. (2) and Eq. (25). The thin lines show the results capped at k = 0.5 (see text). 



APPENDIX 

A. LENSING BY AN NFW HALO 

In this appendix we review our lens model used for the numerical results. The Navarro-Frenk- White density profile is given by 
(Navarro et al. 1997): 

, , Ps M 

P(r) ~ r/r s (\+r/r s ) 21 Ps ~ 47Tif|Tn(l + c)-c/(\ +c)] ' ( j 

where r s is the scale radius, and c = R200 /r s is taken from the fit of Bullock et al. (2001). Note that this halo profile is not truncated 
at ^200- Actual halo profiles lie somewhere in between the extrapolated NFW profile adopted here and a truncated profile. The 
differences however appear mainly around R200, which is typically larger than the filter size we will consider. Hence, the details 
of the outer halo profile do not change the results significantly. 

Further, we make the small angle approximation. The lensing quantities n and 7 for an NFW halo have been derived in the 
literature (Oaxaca Wright and Brainerd (1999); Maturi et al. (2005)). For reference, the convergence at the scale radius is given 
by 

Ks= P^3Ps^ (1+Zl)Wl(zl) (A2) 

Here, zl is the halo redshift, p cr o is the critical density today, c is the speed of light, and the lensing weight function is given by 

c f°° yi dN 
W l (zl) = tttt / dz s " Xl] -z- , (A3) 

where xl = xfocX and dN/dz is the normalized redshift distribution of the source galaxies, Eq. (1), determined from a fit to the 
expected redshift distribution of galaxies for DES (Annis 2009). 

Fig. 8 (left panel) shows the profile of k, 7, g as well as p for a typical detectable lensing halo at zl = 0.3. While k, 7 are well- 
behaved, both g and p show the strong-lensing caustics, as expected. For this halo and source/lens redshifts, the strong-lensing 
regime defined by g(6) ~ 1 is relevant for 9 < 0.05'. This regime cannot be treated by weak lensing analysis techniques, so it 
should be removed. Fortunately, the bulk of the signal-to-noise for weak lensing is at much larger radii. The right panel of Fig. 8 
shows the relevant lensing quantities weighted by the Gaussian filter Eq. (12) for 9 = 3 arcmin. Clearly, the bulk of the signal 
comes from 9 ~ 0.3-3 arcmin, well outside the strong lensing regime for this halo. Although not of practical relevance, very 
massive halos do appear in the calculations where the strong lensing regime extends to beyond 0.2 arcmin. For those cases, we 
cap the convergence at k = 0.5. This is merely done to lead to convergence of the calculation, and only occurs for such high halo 
masses that it does not impact the results presented here. 
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B. LARGE-SCALE STRUCTURE VARIANCE 

Here we re-derive the variance of A due to large scale structure in the survey (see Hoekstra 2001). Since the bulk of the survey 
area will have small convergence, we can work in the weak-lensing limit, in which case A and B are equivalent. Furthermore, 
we only calculate the leading contribution from the power spectrum of the lensing field, neglecting higher order moments which 
in principle can become relevant on very small scales. The computation is most conveniently done in Fourier space. For this, 
we first rewrite Eq. (6) in the weak lensing limit (g = 7) as as an expression for the filtered convergence k (e.g., Bartelmann and 
Schneider 2001): 

A(9) = J d 2 9'K(6')W K (\9-9'\) (Bl) 
f°° d9' 

W K (9) = 2 / — W(6')-W(6). (B2) 
Je " 

Note the non-local relationship between the shear filter and the corresponding convergence filter. Since K is a scalar quantity, 
Eq. (Bl) can be straightforwardly written in Fourier space: 

A{£) = h{£)W K {£), (B3) 

where tilded quantities stand for Fourier transforms: 

X(£) = d 2 9e ilS X{9). (B4) 



°l SS = (A 2 )= I t^2\W k (£)\ 2 C*(£). (B6) 



Then, using the definition of the angular power spectrum of the convergence: 

(k(£) K (-£')) = (2n) 2 S D (£- £')C K (£), (B5) 

we can write the first-order variance of A as: 

dh 

(2tt) 2 

This quantity is shown as dotted line in Fig. 2, for a Gaussian shear filter Eq. (12) (note that W K for this filter is not a Gaussian). 
It is also straightforward to calculate the leading lensing bias correction to olss, following the second order correction to C K (£) 
presented in Schmidt et al. (2009b). While we include this correction, the relative change of ctlss only amounts to a few percent 
for filter scales 6 > 1' (see also White (2005)). 

C. VARIANCE OF THE LOCATION-NORMALIZED ESTIMATOR 

In this section, we derive the variance of B due to shot noise and source clustering, taking into account the covariance between 
numerator and denominator. We expand B =A/N around its expectation value, (B) = (A)/(N), assuming that fluctuations are 
much smaller than unity (justified if n 6 2 ^> 1). In close analogy to the derivation in the appendix of paper I, this yields 

Var(fi) _ Var(A) | g Var(A>) <| Cov(i, A>) 

(B) 2 (A) 2 (AO 2 (A) (iV) ' 

where 

Cov(A,A>) = (AN)-(A)(N) = ^ J d 2 9W 2 (9)^/ 2 (9)g(9) + J d 2 9 J d 2 9' W n ql2 W' ii' ql2 g' £(\9' -9\) (C2) 

Var(A>)=i / d 2 9W 2 (9)^l 2 (9) + J d 2 9 J d 2 9' W ^ q/2 W ^ q/2 ^(\9-9'\). (C3) 

Here, we have used the shorthand notation introduced after Eq. (24). In each of these equations, the first term denotes the shot 
noise contribution, while the second is the contribution from source clustering. We see from Eq. (CI) that there is a cancelation 
of noise terms (both shot noise and source clustering) due to the positive covariance between A and N. The cancelation is not 
perfect, since the various terms involve different integrals over shear and magnification. However, for relatively low-mass halos 
for which g 2 <C c 2 , the shot noise term of Var(A) [Eq. (22)] dominates in Eq. (CI). In this limit, we recover Var(B) = Var(A) (see 
Fig. 3). 

If we neglect lensing bias and source clustering in both B and A, so that (A) = (B), Eq. (CI) simplifies to 

- (A) 2 / Afd 2 9W 2 g\ 
Var(B) = Var(A) + Vj- 3 -4 j — , (C4) 



fiA V Jd 2 9Wg 

where we have defined A= 1/ J d 2 9W 2 as the effective filter area. If the shear g was constant, then the last term in Eq. (C4) 
would evaluate to 4, canceling the g 2 term in Var(A) [Eq. (22)] and leading to Var(B) = cr 2 /(2«^4). Eq. (C4) makes it clear 
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that the last term determines whether B performs better or worse than A: if A J W 2 g/ J Wg is order unity, Var(fi) < Var(A). If 
it is much less than 1, Var(B) can be significantly larger than Var(A). The latter is the case for the KS-Gaussian filter, where 
A fW 2 g/ JW g w 0.2-0.3, due to the preferential weighting of very large scales. Similar reasoning applies to the source 

clustering contribution. This explains why B performs worse than A for the KS-Gaussian filter. Note that this effect will be 
mitigated when employing this filter on a pixelized shear map: since the sum in Eq. (25) now only runs over galaxies within 
a small pixel (with W = 1) instead of running over all galaxies, there will be significant covariance between numerator and 
denominator, reducing the noise in the pixelized shear. 
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